Rank in Wordlist | Frequency | Word |
---|---|---|
2328 | 124 | 2,270 |
2689 | 105 | 1,000 |
3010 | 91 | 1,625 |
3594 | 72 | 4,240 |
3808 | 66 | 2,000 |
4060 | 60 | 5,000 |
4300 | 55 | 10,000 |
4601 | 50 | 1,990 |
4663 | 49 | 3,000 |
4726 | 48 | 2,970 |
Rank in Wordlist | Frequency | Word |
---|---|---|
19802 | 4 | 100% |
27933 | 2 | 40% |
27984 | 2 | 50% |
28080 | 2 | 80% |
37378 | 1 | 10% |
38114 | 1 | 2% |
38212 | 1 | 2.1% |
38381 | 1 | 21% |
38431 | 1 | 22% |
38807 | 1 | 3.7% |
Rank in Wordlist | Frequency | Word |
---|---|---|
5530 | 38 | R&D |
16369 | 6 | E&E |
18624 | 5 | R&D&C |
20331 | 4 | E&P |
22790 | 3 | 16&17 |
25016 | 3 | R&B |
25017 | 3 | R&R |
29602 | 2 | F&B |
31004 | 2 | M&E |
32594 | 2 | S&T |
Rank in Wordlist | Frequency | Word |
---|---|---|
8433 | 19 | AS$1 |
16248 | 6 | B$10 |
16249 | 6 | B$500 |
16983 | 6 | US$1 |
17855 | 5 | B$100 |
17856 | 5 | B$65 |
18863 | 5 | US$2 |
19961 | 4 | AS$2 |
19962 | 4 | AS$50,000 |
19963 | 4 | AS$8 |
Rank in Wordlist | Frequency | Word |
---|---|---|
478 | 661 | Al-Qur'an |
1948 | 156 | Da'wah |
2147 | 138 | Wata'ala |
6001 | 33 | Ra'es |
6518 | 29 | Qur'an |
8485 | 19 | Ma'had |
10676 | 13 | Sa'adul |
11098 | 12 | Ja'afar |
11728 | 11 | King's |
11865 | 11 | Syar'ie |
Rank in Wordlist | Frequency | Word |
---|---|---|
9006 | 17 | ASEAN+3 |
24767 | 3 | PPPS+M |
37379 | 1 | 10+3 |
38115 | 1 | 2+6 |
39009 | 1 | 4+1 |
39633 | 1 | 6738716001/+6738716002 |
40260 | 1 | AFT+3 |
40305 | 1 | AMAF+3 |
40422 | 1 | ASEAN+1 |
40423 | 1 | ASEAN+China |
Rank in Wordlist | Frequency | Word |
---|---|---|
1850 | 167 | BDTVEC/BTEC |
5599 | 37 | Kepujian/Kredit |
6796 | 27 | Melayu/Melanau |
7272 | 24 | 2009/2010 |
7463 | 23 | 2010/2011 |
8169 | 20 | 1432H/2011M |
8991 | 17 | 2011/2012 |
9663 | 15 | 1431H/2010M |
10275 | 14 | dan/atau |
11068 | 12 | HIV/AIDS |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots